Recent Work in the Document Image Decoding Group at Xerox PARC
نویسندگان
چکیده
Speed Enhancements to DID (Section 2) When Document Image Decoding (DID) was proposed [15], its attractiveness lay primarily in its potential for high recognition accuracy, owing to its communications-theoretic framework, and well defined models and objective function (posterior probability). In its initial implementations it suffered from high computational cost relative to commercial OCR methods. We will summarize recent progress made on reducing its computational cost. Importantly, these speed enhancements do not come at the expense of accuracy; they are guaranteed to result in the same recognition output as DID without the enhancements.
منابع مشابه
Speech and Text-Image Processing in Documents
Two themes have evolved in speech and text image processing work at Xerox PARC that expand and redefine the role of recognition technology in d~ument-oriented appficafions. One is the development of systems that provide functionality similar to that of text processors but operate directly on audio and scanned image data. A second, related theme is the use of speech and text-nnage recognition to...
متن کاملDocument image decoding in the UC Berkeley Digital Library
The UC Berkeley Environmental Digital Library Project is one of six university-led projects that were initiated in the fall of 1994 as part of a four-year digital library initiative sponsored by the NSF, NASA and ARPA. The Berkeley project is particularly interesting from a document image analysis perspective because its testbed collection consists almost entirely of scanned materials. As a res...
متن کاملModel-Directed Document Image Analysis
If current OCR engineering trends continue, then, we believe, \general{purpose" systems | that is, fully automatic and nonretargetable systems | will leave many potential users unsat-issed, and lucrative application niches unnlled, for years to come. However, for users who care enough to volunteer some manual eeort | to help customize the system to their document(s) | signiicantly higher accura...
متن کاملNetwork Working Group R. Braden Request for Comments: 1633 ISI Category: Informational D. Clark MIT S. Shenker Xerox PARC June 1994 Integrated Services in the Internet Architecture: an Overview
This memo discusses a proposed extension to the Internet architecture and protocols to provide integrated services, i.e., to support realtime as well as the current non-real-time service of IP. This extension is necessary to meet the growing need for real-time service for a variety of new applications, including teleconferencing, remote seminars, telescience, and distributed simulation. This me...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001